Picture for Tae-Hyun Oh

Tae-Hyun Oh

POSTECH

VSC: Visual Search Compositional Text-to-Image Diffusion Model

Add code
May 02, 2025
Viaarxiv icon

JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers

Add code
May 01, 2025
Viaarxiv icon

AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation

Add code
Apr 29, 2025
Viaarxiv icon

VoiceCraft-Dub: Automated Video Dubbing with Neural Codec Language Models

Add code
Apr 03, 2025
Viaarxiv icon

Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics

Add code
Mar 27, 2025
Viaarxiv icon

FPGS: Feed-Forward Semantic-aware Photorealistic Style Transfer of Large-Scale Gaussian Splatting

Add code
Mar 11, 2025
Viaarxiv icon

Dr. Splat: Directly Referring 3D Gaussian Splatting via Direct Language Embedding Registration

Add code
Feb 23, 2025
Viaarxiv icon

Zero-shot Depth Completion via Test-time Alignment with Affine-invariant Depth Prior

Add code
Feb 10, 2025
Viaarxiv icon

The Devil is in the Details: Simple Remedies for Image-to-LiDAR Representation Learning

Add code
Jan 16, 2025
Viaarxiv icon

SoundBrush: Sound as a Brush for Visual Scene Editing

Add code
Dec 31, 2024
Figure 1 for SoundBrush: Sound as a Brush for Visual Scene Editing
Figure 2 for SoundBrush: Sound as a Brush for Visual Scene Editing
Figure 3 for SoundBrush: Sound as a Brush for Visual Scene Editing
Figure 4 for SoundBrush: Sound as a Brush for Visual Scene Editing
Viaarxiv icon